Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Quantitative analysis of mathematical documents

Identifieur interne : 001380 ( Main/Exploration ); précédent : 001379; suivant : 001381

Quantitative analysis of mathematical documents

Auteurs : S. Uchida [Japon] ; A. Nomura [Japon] ; Masakazu Suzuki (mathématicien) [Japon]

Source :

RBID : Pascal:06-0054253

Descripteurs français

English descriptors

Abstract

Mathematical documents are analyzed from several viewpoints for the development of practical OCR for mathematical and other scientific documents. Specifically, four viewpoints are quantified using a large-scale database of mathematical documents, containing 690,000 manually ground-truthed characters: (i) the number of character categories, (ii) abnormal characters (e.g.. touching characters), (iii) character size variation, and (iv) the complexity of the mathematical expressions. The result of these analyses clarifies the difficulties of recognizing mathematical documents and then suggests several promising directions to overcome them.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Quantitative analysis of mathematical documents</title>
<author>
<name sortKey="Uchida, S" sort="Uchida, S" uniqKey="Uchida S" first="S." last="Uchida">S. Uchida</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Department of Intelligent Systems, Kyushu University, 6-10-1, Hakozaki</s1>
<s2>Higashi-ku, Fukuoka-shi</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Higashi-ku, Fukuoka-shi</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Nomura, A" sort="Nomura, A" uniqKey="Nomura A" first="A." last="Nomura">A. Nomura</name>
<affiliation wicri:level="4">
<inist:fA14 i1="02">
<s1>Department of Mathematics, Kyushu University, 6-10-1, Hakozaki</s1>
<s2>Higashi-ku, Fukuoka-shi</s2>
<s3>JPN</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Higashi-ku, Fukuoka-shi</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Suzuki, M" sort="Suzuki, M" uniqKey="Suzuki M" first="M." last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation wicri:level="4">
<inist:fA14 i1="02">
<s1>Department of Mathematics, Kyushu University, 6-10-1, Hakozaki</s1>
<s2>Higashi-ku, Fukuoka-shi</s2>
<s3>JPN</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Higashi-ku, Fukuoka-shi</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">06-0054253</idno>
<date when="2005">2005</date>
<idno type="stanalyst">PASCAL 06-0054253 INIST</idno>
<idno type="RBID">Pascal:06-0054253</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000417</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000370</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000388</idno>
<idno type="wicri:doubleKey">1433-2833:2005:Uchida S:quantitative:analysis:of</idno>
<idno type="wicri:Area/Main/Merge">001418</idno>
<idno type="wicri:Area/Main/Curation">001380</idno>
<idno type="wicri:Area/Main/Exploration">001380</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Quantitative analysis of mathematical documents</title>
<author>
<name sortKey="Uchida, S" sort="Uchida, S" uniqKey="Uchida S" first="S." last="Uchida">S. Uchida</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Department of Intelligent Systems, Kyushu University, 6-10-1, Hakozaki</s1>
<s2>Higashi-ku, Fukuoka-shi</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Higashi-ku, Fukuoka-shi</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Nomura, A" sort="Nomura, A" uniqKey="Nomura A" first="A." last="Nomura">A. Nomura</name>
<affiliation wicri:level="4">
<inist:fA14 i1="02">
<s1>Department of Mathematics, Kyushu University, 6-10-1, Hakozaki</s1>
<s2>Higashi-ku, Fukuoka-shi</s2>
<s3>JPN</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Higashi-ku, Fukuoka-shi</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Suzuki, M" sort="Suzuki, M" uniqKey="Suzuki M" first="M." last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation wicri:level="4">
<inist:fA14 i1="02">
<s1>Department of Mathematics, Kyushu University, 6-10-1, Hakozaki</s1>
<s2>Higashi-ku, Fukuoka-shi</s2>
<s3>JPN</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Higashi-ku, Fukuoka-shi</wicri:noRegion>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">International journal on document analysis and recognition : (Print)</title>
<title level="j" type="abbreviated">Int. j. doc. anal. recognit. : (Print)</title>
<idno type="ISSN">1433-2833</idno>
<imprint>
<date when="2005">2005</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">International journal on document analysis and recognition : (Print)</title>
<title level="j" type="abbreviated">Int. j. doc. anal. recognit. : (Print)</title>
<idno type="ISSN">1433-2833</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Character recognition</term>
<term>Database</term>
<term>Document analysis</term>
<term>Ground truth</term>
<term>Mathematical formula</term>
<term>Optical character recognition</term>
<term>Quantitative analysis</term>
<term>Very large databases</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Analyse documentaire</term>
<term>Reconnaissance caractère</term>
<term>Reconnaissance optique caractère</term>
<term>Base donnée très grande</term>
<term>Base donnée</term>
<term>Analyse quantitative</term>
<term>Réalité terrain</term>
<term>Formule mathématique</term>
<term>.</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Base de données</term>
<term>Analyse quantitative</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Mathematical documents are analyzed from several viewpoints for the development of practical OCR for mathematical and other scientific documents. Specifically, four viewpoints are quantified using a large-scale database of mathematical documents, containing 690,000 manually ground-truthed characters: (i) the number of character categories, (ii) abnormal characters (e.g.. touching characters), (iii) character size variation, and (iv) the complexity of the mathematical expressions. The result of these analyses clarifies the difficulties of recognizing mathematical documents and then suggests several promising directions to overcome them.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Japon</li>
</country>
<region>
<li>Kyūshū</li>
<li>Préfecture de Fukuoka</li>
</region>
<settlement>
<li>Fukuoka</li>
</settlement>
<orgName>
<li>Université de Kyūshū</li>
</orgName>
</list>
<tree>
<country name="Japon">
<region name="Kyūshū">
<name sortKey="Uchida, S" sort="Uchida, S" uniqKey="Uchida S" first="S." last="Uchida">S. Uchida</name>
</region>
<name sortKey="Nomura, A" sort="Nomura, A" uniqKey="Nomura A" first="A." last="Nomura">A. Nomura</name>
<name sortKey="Suzuki, M" sort="Suzuki, M" uniqKey="Suzuki M" first="M." last="Suzuki">Masakazu Suzuki (mathématicien)</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001380 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001380 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:06-0054253
   |texte=   Quantitative analysis of mathematical documents
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024